Information filtering via hill climbing, wordnet, and index patterns

نویسندگان

  • Kenrick J. Mock
  • V. Rao Vemuri
چکیده

The recent explosion in Internet growth has left many users awash in a sea of information, and has spurred the need for intelligent filtering systems. This paper describes work implemented in the INFOS (Intelligent News Filtering Organizational System) filtering system that is designed to reduce the user's search burden by automatically categorizing data as relevant or irrelevant based upon user interests. These predictions are learned automatically based upon features taken from input data articles and collaborative features derived from other users. The actual filtering is performed via a hybrid technique that combines a keyword-based hill climbing method, the knowledge-based conceptual representation of WordNet, and partial parsing via index patterns. A hybrid system integrating all approaches combines the benefits of each while maintaining robustness and scalability. INFOS has been tested upon Usenet news articles and preliminary tests have been performed on WWW pages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intelligent Information Filtering via Hybrid Techniques : Hill Climbing , Case - Based Reasoning , Index Patterns , and Genetic Algorithms

As the size of the Internet increases, the amount of data available to users has dramatically risen, resulting in an information overload for users. This work shows that information overload is a problem, and that data is organized poorly by existing browsers. To address these problems, an intelligent information news filtering system named INFOS (Intelligent News Filtering Organizational Syste...

متن کامل

Adaptive User Models for Intelligent Information Filtering

As networked systems grow in size, the amount of data available to users has increased dramatically. The result is an information overload for the user. In this project, an intelligent information filtering system reduced the user's search burden by automatically eliminating incoming data predicted to be irrelevant. These predictions are learned by adapting an internal user model which is based...

متن کامل

Wrappers for Feature Subset Selection in CRF-based Clinical Information Extraction

We present our methodology and the results for Task 1 of the CLEF eHealth Evaluation Lab 2016. This benchmark addresses clinical information extraction related to nursing shift changes, whereas the challenge is to maximize the correctness in structuring written free-text records by automatically identifying relevant text-snippets. Our approach is focused on the exploration of relevant features ...

متن کامل

Analysis of En-route Filtering Scheme for Information Reporting in Wireless Sensor Networks

In wireless sensor networks, adversaries can injectfalse data reports via compromised nodes and launch DoS attacks against legitimate reports. Recently, a number of filtering schemes against false reports have been proposed. However, they either lack strong filtering capacity or cannot support highly dynamic sensor networks very well. Moreover, few of them can deal with DoS attacks simultaneous...

متن کامل

Parçacık süzgeci ile biyomedikal görüntü zaman serisi çakıştırma Biomedical image time series registration with particle filtering

We propose a family of methods for biomedical image time series registration based on Particle filtering. The first method applies an intensity-based information-theoretic approach to calculate importance weights. An effective second group of methods use landmark-based approaches for the same purpose by automatically detecting intensity maxima or SIFT interest points from image time series. A b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 33  شماره 

صفحات  -

تاریخ انتشار 1997